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Method and apparatus for Compressed data Storage and Retrieval 



This invention relates to a method of, and apparatus for, compressing data and a 
method of, and apparatus for, decompressing data. The invention may be used in a 
computer graphics system, and in particular, in a computer graphics system in the 
field of 3D computer graphics and for applying detail to otherwise smooth surfaces 
through use of a 'bump mapping' algorithm, in particular one based on storing per- 
pixel surface normals in a texture. 

Blinn introduced bump mapping in "Simulation of Wrinkled Surfaces" (SIGGRAPH 
1978, pp286-292). The computed shading of surfaces, which is typically done using a 
function of the incoming directions of light rays and surface normal, i.e. the vector 
perpendicular to the surface, gives important clues as to the orientation and also 
roughness of that surface. Blinn' s bump mapping gave otherwise mathematically 
smooth surfaces the appearance of roughness (or bumps) due to changes in the 
shading caused by altering the computed surface normal on a per-pixel basis. The 
method uses texture mapping to obtain a perturbation vector to modify a surface's 
interpolated-per-pixel normal. 

Peercy et al ("Efficient Bump Mapping Hardware", SIGGRAPH 1997, pp 303-306, 
and US Patent 5,949,424) devised a more efficient method that instead directly stored 
'perturbed' normal vectors in the texture. These normals were defined relative to a 
localized tangent coordinate system. Each light vector had to be expressed in 
coordinates relative to the local tangent space coordinate system. 

Because both the size of textures and the memory bandwidth consumed during 
texturing is an important factor in computer graphics, Fenney (European Patent 
Application 98 939 751 8) describes means of reducing the storage costs of the 
surface normal, which requires three coordinate values (XYZ), to just two values thus 
saving storage space and texturing bandwidth. This method takes advantage of the 
fact that the surface normals are unit vectors defined in the local surface coordinate 
space. As shown in Figure la, the unit normals are primarily restricted to lie in a 
single hemisphere. 

As an alternative to the local tangent space system, the surface normal direction can 
be defined in the object's local coordinate space. Although this has the disadvantage 
that it is difficult (or impossible) to reuse portions of the bump texture for different 
areas of objects, it has the distinct advantages that it is cheaper to compute the 
interpolated lighting vectors and that there is no need to store per- vertex local tangent 
coordinate systems in the model. As an example, Warnes' technique (Patent 
W09527268) starts with this basis and then uses vector quantisation to make it fast. 

With the local coordinate space method, one can note that the surface normal 
directions are now arbitrarily distributed in all directions along the surface of a sphere 
of unit radius (see figure lb), unlike the local tangent space system where they are 
generally spread over one hemisphere. Although the method of surface normal 
compression described by Fenney can be trivially extended by using an additional bit 
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to choose between the hemispheres, this is not ideal, as many of the possible data 
encoding patterns are wasted. 



Although not intended for storage in bump map textures, Deering ("Geometry 
Compression". SIGGRAPH 1995, pp 13-20) presents a means of compressing a 3D 
unit vector into 1 8 bits by identifying six regions in every octant of the unit-radius 
sphere. Unfortunately, 1 8 bits is an inconvenient size for texture storage in a computer 
texturing device where the natural preferred size is typically 8 or 16 bits. Although it 
may be possible to reduce some of the precision of this method so that it does fit into, 
say, 16 bits, the method requires numerous tests as well as fairly expensive 
trigonometric functions. Because a contemporary 3D texturing system needs to be 
able to compute in the order of a billion texturing operations per second, it is 
important that these decompression operations are relatively cheap. 

The invention in its various aspects provides methods and apparatus as set out in the 
claims to which reference should now be made. 

Preferred embodiments of the invention provide a means of storing 3D unit vectors in 
a form that is both optimised for storage in a texture, and for ease of decompression in 
the texturing and shading engine of a computer 3D graphics system. They are capable 
of supporting both the local tangent space and the local coordinate space methods of 
representing surface normals. Finally, the preferred methods make more efficient use 
of the representative bits than that presented in European Patent Application 98 939 
751 8. 

Embodiments of the invention will now be described, by way of example, with 
reference to the attached figures in which: 

Figure la illustrates the range of unit normals needed for the tangent space bump 
mapping method; 

Figure lb shows the larger range required for the local coordinate space method; 
Figure 2 shows the preferred assignment of bits to a 16 bit encoding of a unit vector 
in the invention; 

Figure 3 illustrates decompression method and apparatus embodying the invention; 
Figure 4 illustrates details of method and apparatus for the step in figure 3 of 
computing the inverse of the vector length; 

Figure 5 illustrates the distribution of possible normal locations which can be 
represented in an embodiment of the present invention using two 3-bit values; 
Figure 6 illustrates the distribution of possible normal locations for a known method 
of data compression and retrieval using an approximately equal number of storage bits 
to that used for the embodiment of the invention whose distribution is illustrated in 
figure 5; and 

Figure 7 illustrates the distribution of figure 5 for just one octant pair of the sphere 
together with the corresponding grid of 3 -bit values 

The invention has two main aspects. A means of representing 3D vectors, chosen 
from a set of points on the unit radius sphere, in a compressed binary encoding 
suitable for storage in computer memory, and means of converting said compressed 
format back into 3D unit vectors. As the clearest way of describing the invention is to 
illustrate the decompression process, this will be the approach taken. 
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In the preferred embodiment, each 3D unit vector will be encoded as a 16-bit value, as 
shown in figure 2. Each value consists of 3 fields: A 2-bit' octant pair' identifier, 10, a 
7-bit U parameter, 11, and a 7-bit V parameter, 12. It must be stated that other 
choices of numbers of bits for the U and V parameters can be made without altering 
the concepts of the invention. 

Figure 3 gives an overview of the decompression apparatus. The 'octant pair', 10, is 
supplied to a decode unit, 20, that interprets the input values to produce a pair of 
numerical sign bits. (Note that this is shown merely for clarity -the encoding can be 
chosen such that the behaviour of this unit is trivial). The 'Add and Test Magnitude' 
unit, 21, adds the U and V values, 1 1 and 12, and compares the sum with 127 (half the 
maximum possible value of the sum of 7-bit U and V values). It outputs a flag based 
on the result. (In binary hardware arithmetic, this is a trivial operation). 

Unit 22 takes the pair of sign flags from 20 and the magnitude comparison result from 
21, and combines them with the U and V values, 1 1 and 12, to produce the vector, 
{X' Y' Z'}. This vector is in the direction of the unit vector but is not of unit length. 
Unit 23 computes the inverse of the length of the {X' Y 5 Z'}, and passes this to the 
scaling unit, 24. This then scales the vector to produce a unit vector result. The 
internal operation of these various units will now be described using a C-like pseudo- 
code notation. 

Unit 20, produces two sign flags, Usign and Vsign, based on the 'octant pair' 
identifier, 10, OPI. It should be appreciated that this is actually a completely trivial 
operation in hardware. 

#define IS_POS (0) 
#define IS_NEG (1) 

switch (OPI) 
{ 

case "00": 

Usign = IS_POS; 
Vsign = IS__POS; 
break; 

case "10" : 

Usign = IS_NEG; 
Vsign = IS_POS; 
break; 

case "11": 

Usign = IS_NEG; 
Vsign = IS_NEG; 
break; 

case "01": 

Usign = IS__POS; 
Vsign = IS_NEG,\ 
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break; 

}; 

Unit 21, produces a single bit flag, WO, to identify to which octant of a pair of octants 
the data belongs. This can be described as: 

If (u+v < 128) 
{ 

WO = 0; 

} 

else 
{ 

WO = 1; 

} 

Those skilled in the art will appreciate that, given the range of the input parameters, in 
hardware the comparison and assignments simple amount to OR'ing together the two 
top bits of the sum of the u and v values. 

Unit 22 produces the initial, non-unit-length vector as follows. 

If (WO==0) 

{ 

if(Usign == IS_POS) 
XI = U; 

Else 

XI = -U; 

if(Vsign == IS_POS) 
Yl = U; 

Else 

Yl = -U; 
Zl = 127 - X - Y; 

} 

else 
{ 

Xtemp = 127 - V; 

if(Usign == IS_POS) 
XI = XTemp; 

Else 

XI = -XTemp; 

Ytemp = 127 - U; 

if(Vsign == IS_POS) 
Yl = YTemp; 

Else 

Yl = -YTemp; 
Zl = - (127 - X - Y) ; 
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Unit 23 computes the inverse of the length of the initial vector as a fixed-point 
fractional binary number. It initially computes the square of the length of the vector 
by summing the squares of the components, i.e. 

LengthSQ = X1*X1 + Y1*Y1 + Z1*Z1; 

It will be appreciated that, due to the range of input bits and the calculations 
performed, the possible range of squared lengths is limited. In the preferred 
embodiment, this range is 5377 to 80645 inclusive. 

The inverse square root of this squared length is then computed in a pseudo-floating 
point format, using a method known in the art. With reference to figure 4, this 
calculation can be done using a normalizing shifter, 50, to shift the input an even 
number of bits so that the most significant bit is in either one of the two most 
significant locations. The most significant 10 bits of the result, 51, effectively 
represent a fixed point number in the range [1,4), and are then used to access a lookup 
table, 52. This lookup table returns a fixed point result in the range (0.5,1] 
corresponding to the inverse square root of the input. This result combined with the 
shift amount divided by two, 53, then constitutes the inverse square root of the 
original squared sum. 

Finally, Unit 24 just multiplies the pseudo floating point inverse (i.e. corresponding to 
a multiply followed by shifts and truncates) by each of the X', Y\ and Z' components 
to obtain the normalized result. 

The opposite of the decompression process, i.e. the compression, is essentially the 
described process run in reverse. A unit vector, V, is analysed and the signs of the 
X,Y, and Z components determine which octant the unit vector lies in. The vector is 
then scaled, to produce VI, so that the VI [Z] - 127 - |V1 [X]| - |V1 [Y]|. The U and V 
components can then be computed from the identified octant. 

For illustrative purposes, figure 5 displays the distribution of normal locations for an 
example embodiment where U and V are only assigned 3 bits each. As a comparison, 
figure 6 shows the distribution of points for the adapted version of the method 
described previously by Fenney, using an approximately equivalent number of storage 
bits. As can be seen, the storage is not as even, nor as dense. 

Figure 7 shows the distribution for just one octant pair of the sphere with the 
corresponding grid of U and V values. 

It should be noted that this technique is also useful to other applications which require 
reduced cost storage of unit vectors. The compression of vertex geometry, as 
described by Deering, is one such application. 
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1 

Claims: 



1) A method of retrieving a 3D unit vector from compressed data, the data 
consisting of three fields, the method comprising the steps of: 

a) identifying one of four octant-pairs from the data stored in the first field; 

b) extracting a first and second values from the remaining two fields 

respectively; 

c) comparing sum of said first and second fields, and if said sum is less than 54 
of the maximum possible value, selecting the first octant of the octant pair, 
otherwise selecting the second octant of the pair; 

d) computing initial X, Y, and Z coordinates based on the choice of octant; 

e) normalising said X,Y, and Z values to form a unit vector. 

2) A method of performing bump mapping in which localized surface normals 
are stored and retrieved using the method of claim 1 . 

3) An apparatus performing the steps of claim 1. 

4) A texturing apparatus performing the steps of claim 2. 

5) Data compression and/or decompression apparatus as hereinbefore 
described with particular reference to figures 2 to 5, and 7. 

6) A method of data compression and/or decompression as hereinbefore 
described with particular reference to figures 2 to 5, and 7. 
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